Reinforcement learning

Results: 1147



#Item
411Mathematical optimization / Dynamic programming / Markov decision process / Stochastic control / Reinforcement learning / Linear programming / Simplex algorithm / Statistics / Operations research / Mathematics

A subexponential lower bound for Zadeh’s pivoting rule for solving linear programs and games Oliver Friedmann Department of Computer Science, University of Munich,

Add to Reading List

Source URL: files.oliverfriedmann.de

Language: English - Date: 2012-02-10 07:43:23
412Artificial intelligence / Cognitive science / Machine learning / Dynamic programming / Markov processes / Q-learning / Reinforcement learning / Markov decision process / Temporal difference learning / Statistics / Science / Learning

Balancing Anarchy and Central Control Individual vs. Joint Action Reinforcement Learning Daniel Claes June 18, 2010 Abstract

Add to Reading List

Source URL: michaelkaisers.com

Language: English - Date: 2012-04-29 08:03:28
413Optical fiber / Technology / Routing and wavelength assignment / Routing / Fiber-optic communication

Reinforcement Learning Based Routing in All-Optical Networks with Physical Impairments Yvan Pointurier and Fariba Heidari Department of Electrical and Computer Engineering McGill University, Montreal, QC Email: {yvan.poi

Add to Reading List

Source URL: www.tsp.ece.mcgill.ca

Language: English - Date: 2007-09-19 15:25:57
414Mind / Clicker training / Dog training / Animal training / Operant conditioning / Applied behavior analysis / Reinforcement / Classical conditioning / Learning / Behaviorism / Behavior / Ethology

Syllabus Introduction Welcome and Academy history Resources for students and grads People Progress through the program levels

Add to Reading List

Source URL: www.academyfordogtrainers.com

Language: English - Date: 2014-11-24 14:43:14
415Systems theory / Mathematical optimization / Operations research / Equations / Stochastic control / Reinforcement learning / Markov decision process / Bellman equation / Policy / Statistics / Control theory / Dynamic programming

Journal of Artificial Intelligence Research Submitted 3/13; publishedA Survey of Multi-Objective Sequential Decision-Making Diederik M. Roijers

Add to Reading List

Source URL: jair.org

Language: English - Date: 2013-10-18 15:20:49
416Algorithm / Machine learning / Marcus Hutter / Reinforcement learning

Journal of Artificial Intelligence Research Submitted 07/10; publishedA Monte-Carlo AIXI Approximation Joel Veness

Add to Reading List

Source URL: jveness.info

Language: English - Date: 2011-01-25 03:41:25
417Stochastic control / Control theory / Partially observable Markov decision process / Automated planning and scheduling / Markov decision process / Reinforcement learning / Macro / Algorithm / Statistics / Dynamic programming / Markov processes

Monte Carlo Value Iteration with Macro-Actions Zhan Wei Lim David Hsu Wee Sun Lee

Add to Reading List

Source URL: www.comp.nus.edu.sg

Language: English - Date: 2011-11-09 00:33:14
418Behavior / Artificial intelligence / Game theory / Markov chain / Reinforcement learning / Asymptotic analysis / Reinforcement / Power law / Psychology / Markov models / Statistics / Science

doi:j.geb

Add to Reading List

Source URL: www.luis.izqui.org

Language: English - Date: 2009-07-31 18:02:39
419Multi-agent systems / El Farol Bar problem / Rational choice theory / Economics / Agent-based model / Reinforcement learning / Knowledge / Game theory / Science / Artificial intelligence

The El Farol Bar Problem as an Iterated N-Person Game

Add to Reading List

Source URL: www.complex-systems.com

Language: English - Date: 2013-06-19 13:03:42
420Computational neuroscience / Cybernetics / TD-Gammon / Reinforcement learning / Temporal difference learning / Neural network / Dice / Genetic algorithm / Board game / Games / Machine learning / Backgammon

Coevolution of a Backgammon Player Jordan B. Pollack & Alan D. Blair Computer Science Department Volen Center for Complex Systems Brandeis University Waltham, MA 02254

Add to Reading List

Source URL: www.demo.cs.brandeis.edu

Language: English - Date: 1997-03-13 13:43:33
UPDATE